Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 56504 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 6.0 MiB |
| Average record size in memory | 112.0 B |
Variable types
| NUM | 13 |
|---|---|
| BOOL | 5 |
Annual Income is highly skewed (γ1 = 55.2850603) | Skewed |
Maximum Open Credit is highly skewed (γ1 = 125.8893154) | Skewed |
df_index has unique values | Unique |
Years in current job has 3659 (6.5%) zeros | Zeros |
Number of Credit Problems has 48805 (86.4%) zeros | Zeros |
Bankruptcies has 50309 (89.0%) zeros | Zeros |
Tax Liens has 55424 (98.1%) zeros | Zeros |
Reproduction
| Analysis started | 2020-09-13 12:48:10.088579 |
|---|---|
| Analysis finished | 2020-09-13 12:48:47.189701 |
| Duration | 37.1 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 56504 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35309.86162 |
|---|---|
| Minimum | 1 |
| Maximum | 70629 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 441.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3536.15 |
| Q1 | 17647.75 |
| median | 35342 |
| Q3 | 53008.25 |
| 95-th percentile | 67107.85 |
| Maximum | 70629 |
| Range | 70628 |
| Interquartile range (IQR) | 35360.5 |
Descriptive statistics
| Standard deviation | 20393.53144 |
|---|---|
| Coefficient of variation (CV) | 0.5775590871 |
| Kurtosis | -1.201611937 |
| Mean | 35309.86162 |
| Median Absolute Deviation (MAD) | 17680.5 |
| Skewness | -0.0002610777149 |
| Sum | 1995148421 |
| Variance | 415896124.7 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 70327 | 1 | < 0.1% | |
| 62155 | 1 | < 0.1% | |
| 64202 | 1 | < 0.1% | |
| 58057 | 1 | < 0.1% | |
| 60104 | 1 | < 0.1% | |
| 37575 | 1 | < 0.1% | |
| 39622 | 1 | < 0.1% | |
| 35524 | 1 | < 0.1% | |
| 45763 | 1 | < 0.1% | |
| Other values (56494) | 56494 | > 99.9% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% | |
| 6 | 1 | < 0.1% | |
| 7 | 1 | < 0.1% | |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 70629 | 1 | < 0.1% | |
| 70627 | 1 | < 0.1% | |
| 70626 | 1 | < 0.1% | |
| 70624 | 1 | < 0.1% | |
| 70621 | 1 | < 0.1% |
Current Loan Amount
Real number (ℝ≥0)
| Distinct | 18718 |
|---|---|
| Distinct (%) | 33.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16531110.15 |
|---|---|
| Minimum | 11242 |
| Maximum | 99999999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 441.4 KiB |
Quantile statistics
| Minimum | 11242 |
|---|---|
| 5-th percentile | 78697.3 |
| Q1 | 191026 |
| median | 327250 |
| Q3 | 569706.5 |
| 95-th percentile | 99999999 |
| Maximum | 99999999 |
| Range | 99988757 |
| Interquartile range (IQR) | 378680.5 |
Descriptive statistics
| Standard deviation | 36794308.67 |
|---|---|
| Coefficient of variation (CV) | 2.225761509 |
| Kurtosis | 1.340791018 |
| Mean | 16531110.15 |
| Median Absolute Deviation (MAD) | 165924 |
| Skewness | 1.827727633 |
| Sum | 9.34073848e+11 |
| Variance | 1.35382115e+15 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 99999999 | 9193 | 16.3% | |
| 223322 | 19 | < 0.1% | |
| 262724 | 16 | < 0.1% | |
| 110374 | 15 | < 0.1% | |
| 174372 | 15 | < 0.1% | |
| 223674 | 15 | < 0.1% | |
| 219648 | 14 | < 0.1% | |
| 217646 | 14 | < 0.1% | |
| 216546 | 13 | < 0.1% | |
| 216810 | 13 | < 0.1% | |
| Other values (18708) | 47177 | 83.5% |
| Value | Count | Frequency (%) | |
| 11242 | 1 | < 0.1% | |
| 21450 | 2 | < 0.1% | |
| 21472 | 6 | < 0.1% | |
| 21494 | 1 | < 0.1% | |
| 21516 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 99999999 | 9193 | 16.3% | |
| 789250 | 2 | < 0.1% | |
| 789184 | 3 | < 0.1% | |
| 789096 | 8 | < 0.1% | |
| 789030 | 6 | < 0.1% |
Credit Score
Real number (ℝ≥0)
| Distinct | 321 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1125.161174 |
|---|---|
| Minimum | 585 |
| Maximum | 7510 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 441.4 KiB |
Quantile statistics
| Minimum | 585 |
|---|---|
| 5-th percentile | 664 |
| Q1 | 708 |
| median | 729 |
| Q3 | 742 |
| 95-th percentile | 6890 |
| Maximum | 7510 |
| Range | 6925 |
| Interquartile range (IQR) | 34 |
Descriptive statistics
| Standard deviation | 1560.583029 |
|---|---|
| Coefficient of variation (CV) | 1.386986207 |
| Kurtosis | 10.93415184 |
| Mean | 1125.161174 |
| Median Absolute Deviation (MAD) | 15 |
| Skewness | 3.590480477 |
| Sum | 63576107 |
| Variance | 2435419.392 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 747 | 1460 | 2.6% | |
| 740 | 1432 | 2.5% | |
| 746 | 1416 | 2.5% | |
| 741 | 1381 | 2.4% | |
| 742 | 1378 | 2.4% | |
| 739 | 1318 | 2.3% | |
| 745 | 1278 | 2.3% | |
| 748 | 1271 | 2.2% | |
| 743 | 1222 | 2.2% | |
| 738 | 1203 | 2.1% | |
| Other values (311) | 43145 | 76.4% |
| Value | Count | Frequency (%) | |
| 585 | 8 | < 0.1% | |
| 586 | 5 | < 0.1% | |
| 587 | 7 | < 0.1% | |
| 588 | 8 | < 0.1% | |
| 589 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 7510 | 8 | < 0.1% | |
| 7500 | 17 | < 0.1% | |
| 7490 | 15 | < 0.1% | |
| 7480 | 38 | 0.1% | |
| 7470 | 38 | 0.1% |
| Distinct | 31718 |
|---|---|
| Distinct (%) | 56.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1377067.953 |
|---|---|
| Minimum | 81092 |
| Maximum | 165557393 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 441.4 KiB |
Quantile statistics
| Minimum | 81092 |
|---|---|
| 5-th percentile | 521778 |
| Q1 | 848112.5 |
| median | 1171882 |
| Q3 | 1650211.75 |
| 95-th percentile | 2808105 |
| Maximum | 165557393 |
| Range | 165476301 |
| Interquartile range (IQR) | 802099.25 |
Descriptive statistics
| Standard deviation | 1144954.343 |
|---|---|
| Coefficient of variation (CV) | 0.8314436044 |
| Kurtosis | 7525.605284 |
| Mean | 1377067.953 |
| Median Absolute Deviation (MAD) | 380038 |
| Skewness | 55.2850603 |
| Sum | 7.780984764e+10 |
| Variance | 1.310920447e+12 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1162572 | 16 | < 0.1% | |
| 1140000 | 15 | < 0.1% | |
| 1146612 | 14 | < 0.1% | |
| 969475 | 13 | < 0.1% | |
| 1143762 | 13 | < 0.1% | |
| 931190 | 12 | < 0.1% | |
| 1128486 | 11 | < 0.1% | |
| 973370 | 11 | < 0.1% | |
| 1126320 | 11 | < 0.1% | |
| 1139430 | 11 | < 0.1% | |
| Other values (31708) | 56377 | 99.8% |
| Value | Count | Frequency (%) | |
| 81092 | 1 | < 0.1% | |
| 94867 | 1 | < 0.1% | |
| 97033 | 1 | < 0.1% | |
| 106533 | 1 | < 0.1% | |
| 111245 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 165557393 | 1 | < 0.1% | |
| 36475440 | 1 | < 0.1% | |
| 28095300 | 1 | < 0.1% | |
| 24161540 | 1 | < 0.1% | |
| 22448880 | 2 | < 0.1% |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.630114682 |
|---|---|
| Minimum | 0 |
| Maximum | 10 |
| Zeros | 3659 |
| Zeros (%) | 6.5% |
| Memory size | 220.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 6 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.157234963 |
|---|---|
| Coefficient of variation (CV) | 0.8697342203 |
| Kurtosis | -0.7757301789 |
| Mean | 3.630114682 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.7625970267 |
| Sum | 205116 |
| Variance | 9.96813261 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 19983 | 35.4% | |
| 2 | 5212 | 9.2% | |
| 3 | 4609 | 8.2% | |
| 10 | 4576 | 8.1% | |
| 5 | 3852 | 6.8% | |
| 0 | 3659 | 6.5% | |
| 4 | 3427 | 6.1% | |
| 6 | 3206 | 5.7% | |
| 7 | 3159 | 5.6% | |
| 8 | 2584 | 4.6% |
| Value | Count | Frequency (%) | |
| 0 | 3659 | 6.5% | |
| 1 | 19983 | 35.4% | |
| 2 | 5212 | 9.2% | |
| 3 | 4609 | 8.2% | |
| 4 | 3427 | 6.1% |
| Value | Count | Frequency (%) | |
| 10 | 4576 | 8.1% | |
| 9 | 2237 | 4.0% | |
| 8 | 2584 | 4.6% | |
| 7 | 3159 | 5.6% | |
| 6 | 3206 | 5.7% |
Monthly Debt
Real number (ℝ≥0)
| Distinct | 46667 |
|---|---|
| Distinct (%) | 82.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18438.89051 |
|---|---|
| Minimum | 0 |
| Maximum | 229057.92 |
| Zeros | 42 |
| Zeros (%) | 0.1% |
| Memory size | 441.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3771.12 |
| Q1 | 10188.37 |
| median | 16172.99 |
| Q3 | 23939.7625 |
| 95-th percentile | 40452.197 |
| Maximum | 229057.92 |
| Range | 229057.92 |
| Interquartile range (IQR) | 13751.3925 |
Descriptive statistics
| Standard deviation | 12132.89272 |
|---|---|
| Coefficient of variation (CV) | 0.6580055732 |
| Kurtosis | 9.896131914 |
| Mean | 18438.89051 |
| Median Absolute Deviation (MAD) | 6708.33 |
| Skewness | 1.927606688 |
| Sum | 1041871069 |
| Variance | 147207085.7 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 42 | 0.1% | |
| 10647.98 | 6 | < 0.1% | |
| 13359.85 | 6 | < 0.1% | |
| 12656.47 | 6 | < 0.1% | |
| 14535 | 5 | < 0.1% | |
| 12907.08 | 5 | < 0.1% | |
| 13520.78 | 5 | < 0.1% | |
| 7538.82 | 5 | < 0.1% | |
| 15632.06 | 5 | < 0.1% | |
| 11233.75 | 5 | < 0.1% | |
| Other values (46657) | 56414 | 99.8% |
| Value | Count | Frequency (%) | |
| 0 | 42 | 0.1% | |
| 19.57 | 1 | < 0.1% | |
| 28.5 | 1 | < 0.1% | |
| 34.96 | 2 | < 0.1% | |
| 52.63 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 229057.92 | 1 | < 0.1% | |
| 205801.35 | 1 | < 0.1% | |
| 173265.56 | 1 | < 0.1% | |
| 172156.15 | 1 | < 0.1% | |
| 165810.53 | 1 | < 0.1% |
Years of Credit History
Real number (ℝ≥0)
| Distinct | 496 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.27673262 |
|---|---|
| Minimum | 3.8 |
| Maximum | 70.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 441.4 KiB |
Quantile statistics
| Minimum | 3.8 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 13.5 |
| median | 17 |
| Q3 | 21.8 |
| 95-th percentile | 31.8 |
| Maximum | 70.5 |
| Range | 66.7 |
| Interquartile range (IQR) | 8.3 |
Descriptive statistics
| Standard deviation | 7.044508895 |
|---|---|
| Coefficient of variation (CV) | 0.3854359005 |
| Kurtosis | 1.760107442 |
| Mean | 18.27673262 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 1.076234182 |
| Sum | 1032708.5 |
| Variance | 49.62510558 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 16 | 774 | 1.4% | |
| 15 | 739 | 1.3% | |
| 17 | 691 | 1.2% | |
| 16.5 | 642 | 1.1% | |
| 14 | 639 | 1.1% | |
| 15.4 | 601 | 1.1% | |
| 13 | 589 | 1.0% | |
| 17.5 | 560 | 1.0% | |
| 18 | 559 | 1.0% | |
| 14.5 | 528 | 0.9% | |
| Other values (486) | 50182 | 88.8% |
| Value | Count | Frequency (%) | |
| 3.8 | 1 | < 0.1% | |
| 3.9 | 2 | < 0.1% | |
| 4 | 4 | < 0.1% | |
| 4.1 | 5 | < 0.1% | |
| 4.2 | 13 | < 0.1% |
| Value | Count | Frequency (%) | |
| 70.5 | 1 | < 0.1% | |
| 65 | 1 | < 0.1% | |
| 60.5 | 1 | < 0.1% | |
| 59.9 | 1 | < 0.1% | |
| 59.7 | 1 | < 0.1% |
Number of Open Accounts
Real number (ℝ≥0)
| Distinct | 50 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.14807801 |
|---|---|
| Minimum | 1 |
| Maximum | 76 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 441.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 8 |
| median | 10 |
| Q3 | 14 |
| 95-th percentile | 20 |
| Maximum | 76 |
| Range | 75 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 5.037188975 |
|---|---|
| Coefficient of variation (CV) | 0.4518437142 |
| Kurtosis | 3.434546278 |
| Mean | 11.14807801 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.212730089 |
| Sum | 629911 |
| Variance | 25.37327277 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 9 | 5318 | 9.4% | |
| 10 | 5079 | 9.0% | |
| 8 | 4959 | 8.8% | |
| 11 | 4885 | 8.6% | |
| 7 | 4570 | 8.1% | |
| 12 | 4210 | 7.5% | |
| 6 | 3848 | 6.8% | |
| 13 | 3479 | 6.2% | |
| 14 | 2912 | 5.2% | |
| 5 | 2631 | 4.7% | |
| Other values (40) | 14613 | 25.9% |
| Value | Count | Frequency (%) | |
| 1 | 10 | < 0.1% | |
| 2 | 260 | 0.5% | |
| 3 | 758 | 1.3% | |
| 4 | 1604 | 2.8% | |
| 5 | 2631 | 4.7% |
| Value | Count | Frequency (%) | |
| 76 | 2 | < 0.1% | |
| 56 | 2 | < 0.1% | |
| 52 | 2 | < 0.1% | |
| 48 | 1 | < 0.1% | |
| 47 | 2 | < 0.1% |
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.164271556 |
|---|---|
| Minimum | 0 |
| Maximum | 15 |
| Zeros | 48805 |
| Zeros (%) | 86.4% |
| Memory size | 441.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 15 |
| Range | 15 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.4808645924 |
|---|---|
| Coefficient of variation (CV) | 2.92725414 |
| Kurtosis | 59.04587267 |
| Mean | 0.164271556 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.20706503 |
| Sum | 9282 |
| Variance | 0.2312307562 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 48805 | 86.4% | |
| 1 | 6664 | 11.8% | |
| 2 | 719 | 1.3% | |
| 3 | 199 | 0.4% | |
| 4 | 65 | 0.1% | |
| 5 | 29 | 0.1% | |
| 6 | 8 | < 0.1% | |
| 7 | 7 | < 0.1% | |
| 8 | 3 | < 0.1% | |
| 15 | 1 | < 0.1% | |
| Other values (4) | 4 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 48805 | 86.4% | |
| 1 | 6664 | 11.8% | |
| 2 | 719 | 1.3% | |
| 3 | 199 | 0.4% | |
| 4 | 65 | 0.1% |
| Value | Count | Frequency (%) | |
| 15 | 1 | < 0.1% | |
| 12 | 1 | < 0.1% | |
| 11 | 1 | < 0.1% | |
| 10 | 1 | < 0.1% | |
| 9 | 1 | < 0.1% |
Current Credit Balance
Real number (ℝ≥0)
| Distinct | 27211 |
|---|---|
| Distinct (%) | 48.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 295003.1523 |
|---|---|
| Minimum | 0 |
| Maximum | 32878968 |
| Zeros | 332 |
| Zeros (%) | 0.6% |
| Memory size | 441.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 30628 |
| Q1 | 113164 |
| median | 211156.5 |
| Q3 | 368053.75 |
| 95-th percentile | 759613.35 |
| Maximum | 32878968 |
| Range | 32878968 |
| Interquartile range (IQR) | 254889.75 |
Descriptive statistics
| Standard deviation | 386236.9245 |
|---|---|
| Coefficient of variation (CV) | 1.30926372 |
| Kurtosis | 1023.610304 |
| Mean | 295003.1523 |
| Median Absolute Deviation (MAD) | 115206.5 |
| Skewness | 17.86431526 |
| Sum | 1.666885812e+10 |
| Variance | 1.491789619e+11 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 332 | 0.6% | |
| 131955 | 11 | < 0.1% | |
| 124013 | 10 | < 0.1% | |
| 137807 | 10 | < 0.1% | |
| 74765 | 10 | < 0.1% | |
| 110219 | 10 | < 0.1% | |
| 173261 | 10 | < 0.1% | |
| 100301 | 10 | < 0.1% | |
| 118009 | 10 | < 0.1% | |
| 153577 | 10 | < 0.1% | |
| Other values (27201) | 56081 | 99.3% |
| Value | Count | Frequency (%) | |
| 0 | 332 | 0.6% | |
| 19 | 3 | < 0.1% | |
| 38 | 6 | < 0.1% | |
| 57 | 3 | < 0.1% | |
| 76 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 32878968 | 1 | < 0.1% | |
| 12986956 | 1 | < 0.1% | |
| 12746397 | 1 | < 0.1% | |
| 11796435 | 1 | < 0.1% | |
| 11361924 | 1 | < 0.1% |
| Distinct | 34890 |
|---|---|
| Distinct (%) | 61.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 727374.6498 |
|---|---|
| Minimum | 0 |
| Maximum | 798255370 |
| Zeros | 398 |
| Zeros (%) | 0.7% |
| Memory size | 441.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 110619.3 |
| Q1 | 276386 |
| median | 473924 |
| Q3 | 792203.5 |
| 95-th percentile | 1662845.8 |
| Maximum | 798255370 |
| Range | 798255370 |
| Interquartile range (IQR) | 515817.5 |
Descriptive statistics
| Standard deviation | 4396453.638 |
|---|---|
| Coefficient of variation (CV) | 6.044276686 |
| Kurtosis | 20536.39908 |
| Mean | 727374.6498 |
| Median Absolute Deviation (MAD) | 234454 |
| Skewness | 125.8893154 |
| Sum | 4.109957722e+10 |
| Variance | 1.932880459e+13 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 398 | 0.7% | |
| 150194 | 9 | < 0.1% | |
| 201630 | 9 | < 0.1% | |
| 376992 | 8 | < 0.1% | |
| 107404 | 8 | < 0.1% | |
| 219802 | 8 | < 0.1% | |
| 163064 | 8 | < 0.1% | |
| 152812 | 8 | < 0.1% | |
| 305580 | 8 | < 0.1% | |
| 290246 | 8 | < 0.1% | |
| Other values (34880) | 56032 | 99.2% |
| Value | Count | Frequency (%) | |
| 0 | 398 | 0.7% | |
| 4334 | 2 | < 0.1% | |
| 4444 | 1 | < 0.1% | |
| 6446 | 3 | < 0.1% | |
| 6468 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 798255370 | 1 | < 0.1% | |
| 380052288 | 1 | < 0.1% | |
| 265512874 | 1 | < 0.1% | |
| 192284158 | 1 | < 0.1% | |
| 162187168 | 1 | < 0.1% |
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1148123978 |
|---|---|
| Minimum | 0 |
| Maximum | 7 |
| Zeros | 50309 |
| Zeros (%) | 89.0% |
| Memory size | 441.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.3466747767 |
|---|---|
| Coefficient of variation (CV) | 3.01948904 |
| Kurtosis | 19.3620556 |
| Mean | 0.1148123978 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.557931492 |
| Sum | 6487.359723 |
| Variance | 0.1201834008 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 50309 | 89.0% | |
| 1 | 5784 | 10.2% | |
| 2 | 224 | 0.4% | |
| 0.1161715048 | 115 | 0.2% | |
| 3 | 54 | 0.1% | |
| 4 | 13 | < 0.1% | |
| 5 | 3 | < 0.1% | |
| 7 | 1 | < 0.1% | |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 50309 | 89.0% | |
| 0.1161715048 | 115 | 0.2% | |
| 1 | 5784 | 10.2% | |
| 2 | 224 | 0.4% | |
| 3 | 54 | 0.1% |
| Value | Count | Frequency (%) | |
| 7 | 1 | < 0.1% | |
| 6 | 1 | < 0.1% | |
| 5 | 3 | < 0.1% | |
| 4 | 13 | < 0.1% | |
| 3 | 54 | 0.1% |
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.02874339992 |
|---|---|
| Minimum | 0 |
| Maximum | 15 |
| Zeros | 55424 |
| Zeros (%) | 98.1% |
| Memory size | 441.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 15 |
| Range | 15 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.2584199928 |
|---|---|
| Coefficient of variation (CV) | 8.990585441 |
| Kurtosis | 489.7666231 |
| Mean | 0.02874339992 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 16.84323624 |
| Sum | 1624.117069 |
| Variance | 0.06678089269 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 55424 | 98.1% | |
| 1 | 760 | 1.3% | |
| 2 | 208 | 0.4% | |
| 3 | 53 | 0.1% | |
| 4 | 29 | 0.1% | |
| 5 | 10 | < 0.1% | |
| 7 | 6 | < 0.1% | |
| 6 | 6 | < 0.1% | |
| 0.02926725664 | 4 | < 0.1% | |
| 15 | 1 | < 0.1% | |
| Other values (3) | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 55424 | 98.1% | |
| 0.02926725664 | 4 | < 0.1% | |
| 1 | 760 | 1.3% | |
| 2 | 208 | 0.4% | |
| 3 | 53 | 0.1% |
| Value | Count | Frequency (%) | |
| 15 | 1 | < 0.1% | |
| 11 | 1 | < 0.1% | |
| 10 | 1 | < 0.1% | |
| 9 | 1 | < 0.1% | |
| 7 | 6 | < 0.1% |
Term_Long Term
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.2 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 41127 | 72.8% | |
| 1 | 15377 | 27.2% |
Home Ownership_HaveMortgage
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.2 KiB |
| 0 | |
|---|---|
| 1 | 118 |
| Value | Count | Frequency (%) | |
| 0 | 56386 | 99.8% | |
| 1 | 118 | 0.2% |
Home Ownership_Own Home
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.2 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 51289 | 90.8% | |
| 1 | 5215 | 9.2% |
Home Ownership_Rent
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.2 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 32709 | 57.9% | |
| 1 | 23795 | 42.1% |
Loan Status
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 441.4 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 42736 | 75.6% | |
| 1 | 13768 | 24.4% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | Current Loan Amount | Credit Score | Annual Income | Years in current job | Monthly Debt | Years of Credit History | Number of Open Accounts | Number of Credit Problems | Current Credit Balance | Maximum Open Credit | Bankruptcies | Tax Liens | Term_Long Term | Home Ownership_HaveMortgage | Home Ownership_Own Home | Home Ownership_Rent | Loan Status | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 59309 | 432256.0 | 685.0 | 895945.0 | 1 | 22473.39 | 17.4 | 17.0 | 0.0 | 451497.0 | 557348.0 | 0.0 | 0.0 | 1 | 0 | 0 | 0 | 1 |
| 1 | 38785 | 172436.0 | 697.0 | 1023891.0 | 7 | 13054.52 | 16.4 | 5.0 | 0.0 | 17233.0 | 28050.0 | 0.0 | 0.0 | 0 | 0 | 0 | 1 | 0 |
| 2 | 43839 | 773388.0 | 709.0 | 1851113.0 | 1 | 32008.73 | 29.2 | 8.0 | 0.0 | 1515592.0 | 2321308.0 | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 |
| 3 | 13694 | 324522.0 | 691.0 | 1609680.0 | 7 | 35010.35 | 14.1 | 17.0 | 0.0 | 330638.0 | 539242.0 | 0.0 | 0.0 | 1 | 0 | 0 | 0 | 0 |
| 4 | 27637 | 99999999.0 | 733.0 | 507110.0 | 5 | 2823.02 | 47.4 | 7.0 | 0.0 | 72181.0 | 97856.0 | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 |
| 5 | 49013 | 137280.0 | 748.0 | 592800.0 | 10 | 5982.34 | 11.0 | 9.0 | 0.0 | 58748.0 | 203654.0 | 0.0 | 0.0 | 0 | 0 | 0 | 1 | 1 |
| 6 | 29378 | 429308.0 | 715.0 | 1942066.0 | 6 | 30425.84 | 14.7 | 13.0 | 0.0 | 369702.0 | 795696.0 | 0.0 | 0.0 | 1 | 0 | 0 | 1 | 0 |
| 7 | 15 | 153252.0 | 714.0 | 1890690.0 | 2 | 21900.35 | 15.7 | 12.0 | 0.0 | 891594.0 | 1081014.0 | 0.0 | 0.0 | 0 | 0 | 0 | 1 | 1 |
| 8 | 64489 | 99999999.0 | 740.0 | 768664.0 | 6 | 17935.43 | 16.5 | 15.0 | 0.0 | 209931.0 | 336182.0 | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 |
| 9 | 46915 | 172722.0 | 733.0 | 1212029.0 | 1 | 12524.42 | 16.1 | 18.0 | 0.0 | 90383.0 | 118800.0 | 0.0 | 0.0 | 0 | 0 | 0 | 1 | 1 |
Last rows
| df_index | Current Loan Amount | Credit Score | Annual Income | Years in current job | Monthly Debt | Years of Credit History | Number of Open Accounts | Number of Credit Problems | Current Credit Balance | Maximum Open Credit | Bankruptcies | Tax Liens | Term_Long Term | Home Ownership_HaveMortgage | Home Ownership_Own Home | Home Ownership_Rent | Loan Status | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 56494 | 4058 | 169356.0 | 691.0 | 493848.0 | 4 | 9630.15 | 7.0 | 4.0 | 0.0 | 74537.0 | 76978.0 | 0.0 | 0.0 | 0 | 0 | 0 | 1 | 1 |
| 56495 | 3342 | 334554.0 | 744.0 | 3544241.0 | 7 | 46370.45 | 14.6 | 14.0 | 0.0 | 304380.0 | 2553914.0 | 0.0 | 0.0 | 0 | 0 | 0 | 1 | 0 |
| 56496 | 61196 | 85932.0 | 7180.0 | 519498.0 | 5 | 9610.77 | 14.7 | 7.0 | 0.0 | 123253.0 | 180422.0 | 0.0 | 0.0 | 0 | 0 | 0 | 1 | 1 |
| 56497 | 53496 | 380930.0 | 725.0 | 2225432.0 | 1 | 17432.50 | 12.6 | 12.0 | 1.0 | 166383.0 | 412544.0 | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 0 |
| 56498 | 12069 | 322212.0 | 709.0 | 779171.0 | 9 | 5161.92 | 11.9 | 6.0 | 0.0 | 106495.0 | 206184.0 | 0.0 | 0.0 | 1 | 0 | 0 | 1 | 0 |
| 56499 | 59829 | 557040.0 | 731.0 | 1731888.0 | 1 | 35936.60 | 25.9 | 16.0 | 0.0 | 636633.0 | 1486232.0 | 0.0 | 0.0 | 1 | 0 | 0 | 0 | 1 |
| 56500 | 849 | 98406.0 | 684.0 | 660953.0 | 3 | 4742.40 | 17.4 | 8.0 | 0.0 | 153121.0 | 244882.0 | 0.0 | 0.0 | 0 | 0 | 0 | 1 | 1 |
| 56501 | 70352 | 334466.0 | 701.0 | 1390648.0 | 5 | 27581.16 | 18.3 | 12.0 | 0.0 | 220780.0 | 289828.0 | 0.0 | 0.0 | 0 | 0 | 0 | 1 | 0 |
| 56502 | 28036 | 186252.0 | 736.0 | 1230060.0 | 1 | 14043.09 | 14.8 | 6.0 | 0.0 | 178220.0 | 383592.0 | 0.0 | 0.0 | 0 | 0 | 1 | 0 | 0 |
| 56503 | 62511 | 171666.0 | 745.0 | 1294869.0 | 0 | 20070.65 | 9.0 | 24.0 | 0.0 | 169860.0 | 455246.0 | 0.0 | 0.0 | 0 | 0 | 0 | 1 | 0 |